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Abstract 


Robotic Process Automation is a platform used to automate boring and repetitive computer processes 
using software bots so that humans could involve in tasks which include creativity and decision making 
which could not be done by robots. Optical Character Recognition takes out printed characters in an 
image and converts it to text. Google Tesseract OCR and Microsoft OCR were the commonly used OCR 
engines available in UiPath, a tool for Robotic Process Automation. In Previous, research on comparing 
those two open source OCR engine, there we made comparison on basic factors which included speed, 
hardware requirements, accuracy ,but in that case, accuracy was been calculated manually which gave us 
results but with less precise, as it was a manual process to substitute scraped data to that formulas, In this 
research we've made results with more precision by performing a String comparison algorithm named, 
“Levenshtein Distance Algorithm” which is deployed in UiPath. 
Keywords: Optical Character Recognition(OCR); Robotics Process Automation(RPA); Google Tesseract 
OCR; Microsoft OCR 


1. Introduction 

Robotics and automation has stepped into reality 
a few years ago and is evolving so rapidly around 
the world in areas such as industrial automation, 
space engineering, stellar space engineering, even 
in urban and rural areas all over the world. I 
always wonder how these programs’ work 
seamlessly 24 by 7 hours,[1]as it was designed to 
do that actually, As we use this technology to 
overcome bored repetitive tasks and human risky 
jobs, we often deploy them daily to do that kinda 
tasks, However they depend on humans to get 
engaged with that technology,[7] there are many 
myths in this century regarding future predictions 
about these technologies (mainly robots or 
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Artificial Intelligence) that theyll overrule the 
humans by making their own decisions and works 
which involve their own creativity. [4]But these 
theories clearly states that machines can’t stand a 
chance against human intelligence cause they’re 
still competing with our skills,[3]but there are 
many tasks which humans seek for technology 
help like repetitive tasks as mentioned above to 
make the result more efficient and with more 
precision, and there’s many investors currently 
investing billions of dollars in these kind of 
technologies to seek more compound profit in 
future. Microsoft OCR, which is a built in OCR 
engine in Microsoft windows 10 and Tesseract 
OCR,[2]an open source OCR engine developed 


98 


www.rspsciencehub.com 

by Google were the two available open source 
OCR engines in UiPath, a tool for Robotic 
Process Automation. In the previous 
paper[1l]research made is by checking the 
accuracy of Tesseract OCR and Microsoft OCR 
by using some manual methods, which is not 
precise. Also, we had used different sets of 
images for testing the accuracy and had also used 
systems of different specifications for this 
research which may result in error for time taken 
and accuracy percentage. Hence, to propose a 
more valid result, we had planned to improve our 
results by using a string comparison algorithm 
named, “Levenshtein algorithm’, which is used to 
calculate the similarity between two input strings 
and returns it's accuracy in percentage. We had 
also used the same set of images for testing both 
OCR engines. And executed the workflow on the 
same system in order to calculate the time taken 
for the execution error-free. 

2. String comparison - levenshtein distance 
algorithm 

The operation accepts two strings and returns the 
percentage of similarity between two _ strings 
using the Levenshtein Algorithm in_ the 
System.Single form. The Levenshtein algorithm 
(also referred to as Edit-Distance) calculates the 
minimum number of editing operations needed to 
change one string in order to obtain another 
string. The dynamic programming approach is the 
most prevalent way of measuring this. A matrix 1s 
initialized and the Levenshtein distance between 
the m-character prefix of one is calculated in the 
(m,n)-cell with the n-prefix of the other term. 
From the upper left to the lower right corner, the 
matrix can be filled. An insert or a delete, 
respectively, corresponds to each hop horizontally 
or vertically. The cost is typically set at 1 for each 
of the tasks. If the two characters in the row and 
column do not match or O, if they do the diagonal 
jump will cost either one. The expense is often 
reduced locally by each cell. This way, the 
Levenshtein gap between both terms is_ the 
number in the lower right corner. 

If it is needed to compare a string to some sample 
data, this could effectively be used. For example, 
the status of an application needs to be updated on 
the basis of a statement from Approvers. "The 
request is approved "Application is approved" The 
request raised 1s approved yesterday" Yesterday's 
request is approved. The traditional string 
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comparison methods would not operate in such 
instances, but the operation will give a percentage 
of similarity. 

3. Methodology 

In this research proposal, an string comparison 
algorithm plays an important role to give more 
accurate results than our previous study, the 
whole sequence of the execution will be like; 
unzipping and feeding the data from our local 
machine storage to the workflow ocr 
engines(either Microsoft OCR engine or 
Tesseract OCR Engine first) ; and redirecting the 
extracted data to the string algorithm analysing 
container as show at figure 2./ and there the 
major part plays on comparing the extracted data 
with the original data from the images which was 
used to feed the OCR engines, and eventually the 
accuracy(data) will be saved to local storage or 
can also use cloud storage for purpose, if we’re 
deploying this workflow to the UiPath 
Orchestrator. 

The main upgrades from the previous paper[1] 1s 
about: 

Same set of source data is supplied to both ocr 
machines to expect better comparison results 

Both workflows for ocr has been executed with 
same hardware equipments whereas used 
different hardwares for previous comparison[1 | 
Images with lighten backgrounds are used to 
extract more data to obtain more precise 
information. 

3.1 What does container refer to? 

Images with fancy or decorable fonts are used to 
test the algorithm [1]Containers or Blocks which 
is often used in uipath studio to classify set of 
activities or program in an order to execute in a 
sequential manner, if a container 1s set to top level 
node, the activities which is under that container 
executes first and then the workflow further 
moves to next container which holds set of 
instructions readily to run followed by the 
previous block. 

4. Architecture and workflow 

4.1 Architecture 

Some people might be good at programming by 
nature. But it's not necessary to be everyone. 
There are many people, who always struggle to 
grasp the basics of programming or simply they 
can't program. [1]Hence UiPath studio offers a 
no-code environment where we can enter with 
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minimal or zero programming background using 
some visual code blocks.[1]There were several 
boring and repetitive tasks in Information 
Technology and Business Process which may be 
mundane for many employees. Hence, we could 
deploy a bot so that humans could involve in 
some other creative activities and other activities 
which include human decisions in order to make 
an accurate process in a very low time.[1]The 
primary architecture of UiPath software consists 
of three components; UiPath Studio, UiPath 
Robot, UiPath Orchestrator which plays a vital 
role in automating a task: 

[1]UiPath Studio is a design tool that enables a 
user to create programs.[I]It has many 
activities(pre-defined functions) and repositories 


Feeding Data to 
COCR Enignes 


oF 
OCR Engine 

” Printing Results 

to the console 





Saving Results to 
Local or Cloud 
Storage 
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that are predefined. To model the workflow for 
the automation process, users can drag and drop 
activities. In simple terms, UiPath Studio is a 
method used to model the automation workflow 
to automate repetitive processes using predefined 
activities and libraries. And, UiPath Robots is a 
software that hosts the process installed in the 
UiPath studio that allows us to carry out our 
projects with or without human monitoring (or) 
Supervision on any computer.UiPath Orchestrator 
is a web application that allows us to manage the 
development and deployment of our resources in 
our machine. It allows us to launch and schedule 
our bots on our or other desktops, and also control 
the bot's status and evaluate the effects of their 
work. 


String algorithm Container 


. Execution of 
Algorithm 


Analysts results 





Fig.1. OCR Flow chart 


4.2 Workflow 


This research consists of two workflows, one for 
Tesseract OCR and another for Microsoft OCR. 
A folder which consists of 100 images is given as 
input. An excel sheet which consists of Original 
Text data of the image is used for the calculation 
of the results. The original text data is read as 
string | for comparison on both workflows and it 
extracts the text data using Tesseract OCR on 
Workflow | and Microsoft OCR on Workflow 2. 
The extracted data is given for String 2 for the 
comparison on respected workflows. Now, the 
extracted data by Tesseract OCR and Microsoft 
5. Result analysis 
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OCR were fed into the columns C and D of the 
excel sheet.[]1]And the similarity between the 
string | and 2 (String | is same for both OCR and 
String 2 is the extracted text respected to the OCR 
engine) were calculated and the percentage is 
returned in columns E and F respectively. 
Similarly the time taken for the execution has 
been calculated on the UiPath Studio for both 
OCRs. Finally, the average of accuracy for the 
100 samples and average time taken for the 
extraction of a single image of each OCR has 
been calculated. 
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a.) Mean Accuracy- There is a measure of the 
similarities between the original text and the 
extracted text calculated by using Levenshtein 
Distance Algorithm-String Comparison in uipath, 
and the value is determined between them, and 
finally the mean accuracy is calculated for the 
determined values b.) Overall Execution Time (or) 
Time taken - The time taken for 100 images to be 
Table 1. Characteristics/ Engine 
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extracted including the time taken for the string 
comparison is calculated and tabulated. c.) Mean 
time taken(per images) - The average time taken to 
extract each picture is calculated and calculated 
and tabulated from the total time taken, and mean 
time taken value will be calculated from the total 
time taken divided by total no of images fed. 


Characteristics/ Engine Tesseract OCR Engine Microsoft OCR Engine 
Mean accuracy(in percentage) 71.04549606 76.68340455 


Time taken(in hour format) 00:19:11.00 00:27:08.00 





Mean time taken(s)per image 00:00:11.51 00:00:16.28 


6. Error analysis 


Some datas has been classified below acc to their accuracy percentage, 


Table 2. Tesseract OCR 


"STAY 
HOME 
STAY SAFE" 


"Never forget 

3 types of p_eople 

In your life: 
LVMNVIMJMEOMM 


2.V\m mmhmdmmm 
mummywindrrmnm" 


20 - 40% "Never forget 
3 types of people 
in your life: 
1. Who helped you in your difficult times. 
2. Who left you in your difficult times. 
3. Who put you in your difficult times." 


40 - 60% "FASTER "FASTER 
THAN A THAN A 
MERCEDES MERCEDES . 
MY HEART BEATS MY HEART BEATS 
FOR YOU FOR 
Loesje YOU 
P.O.-BOX 1045 6801 BA ARNHEM HOLAND" 1922/;" 


60 - 80% "YOUR EYES 
PROMISEO ME 


HEAVEN, 


HEAVEN, 
YOUR HANDS 
TOOK ME 
THERE."" YOUR HANDS 
-Lenon Hodson" TOOK ME THERE.""" 


100% BLACKLANE BLACKLANE 
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Table.3 Microsoft OCR 


20 - 40% 


40 - 60% 


"Never forget 

3 types of people 

in your life: 

1. Who helped you in your difficult 
times. 

2. Who left you in your difficult 
times. 

3. Who put you in your difficult 
times." 


"To heal a 
wound 
you need 
to stop 
touching 
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Never forget 

3 types of people 
un your life: 
CCu 

I. In attest betm 
3. in dTAE Sae" 


"To heal 
woun 
yoti 
toucl?inz 
it.” 


it." 
60 - 80% Hello World! t lollo World! 
100% Blacklane Blacklane 


Note: All images were taken in Joint Photographic group format (.JPG or JPEG). 


6.1 Decorative font outcomes 


pu 
A 





T hs Ue 


For these types of fonts, OCR Engines returned null values and also random alphabets such as, Microsoft 


ocr returned null and Tesseract returned “fi‘lAll/” for the value “love” as in the above image. 


6.2 Images with grey background 
Images with grey background provide (attached above) an accuracy of nearly 80 - 90% in both Google 


Tesseract and Microsoft OCR. 


“Kill them 
with 
success 
and bury 
them with 
a smile.” 


“Some people 
come in your 
life 

as blessings, 
others 

come out as 
lessons.” 
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Things end, People 
change. And you know 
what? Life goes on. 
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7. Future works 


[1]So, the overalls analysis and this comparison 
research has been done under three major factors 
which includes, velocity, accuracy, time taken, 
using distance algorithm, majorly this research was 
the improvised version of the previous research 
which gave more precise results over the first one, 
[1 ]but there are still some portions of this research 
can be upgraded for better purposes such as like 
deploying storage area from local storage to cloud 
storage which will be used to manipulate data over 
worldwide collaborators for further analysis,[1]| 
so,moreover the future work will be on storing 
data with cloud support, Which’ll be also used to 
run other OCR Engines available in the UiPath 
framework? 

Conclusion 

By making calculations at different factors, such as 
precision, time taken, to put the experiment to an 
end.[1] In certain cases the results of the Microsoft 
OCR are more reliable compared to the results of 
the Tesseract OCR. But Tesseract OCR also gives 
better results in certain cases as well.This provides 
different accuracy values.But when considering 
the time it took to identify the characters in the 
images compared to Microsoft OCR, Tesseract got 
efficient result,[1]So by performing _ these 
comparison with the use of String comparison 
algorithm 
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